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(57) Abstract 

An algorithm is described for sparsely updating the Filter coeffi- 
cients of an adaptive filter of an adaptive audio system to facilitate a 
high sampling rate without excessive processing capacity. Computer si- 
mulations are used to illustrate the potential for using an inverse filter 
matrix to manipulate the position of stereophonic images. The possibil- 
ities suggested could be used in conjunction with existing stereophonic 
recordings in order to improve stereophonic imaging in situations 
where the relative locations of loudspeakers and listener are con- 
strained to be non-ideal. In one arrangement, real sound sources (SI, 
S2) provide to the listener the impression that the sources are at more 
widely-spaced positions (VI, V2). In another arrangement, employing 
four sound sources (SI, S2, S3, S4) the impression is created at two lis- 
tener locations (Ml, M2; M3, M4) that the sound is coming from a 
pair of sound sources positioned symmetrically of those listener loca- 
tions. 
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ADAPTIVE AUDIO SYSTEMS AND SOUND REPRODUCTION SYSTEMS 

This invention relates to adaptive audio systems, and to sound 
reproduction systems incorporating multi-channel signal processing 
techniques, 

SUMMARY OF THE INVENTION 

Broadly speaking, the first three aspects of the invention are 
concerned with sound reproduction systems arranged to generate 
'virtual source locations' of sound at positions other than those of 
actual loudspeakers employed to reproduce a sound field. 

According to one aspect of the present invention a sound 
reproduction system comprises a plurality of loudspeakers which are 
arranged asymmetrically with respect to a listener location, the 
loudspeakers being driven through a filter means by a plurality of 
channels of a sound recording, the filter characteristics of the 
filter means being so chosen as to create at the listener location a 
local sound field which is substantially equivalent to the local 
field that would result from playing the sound recording through a 
plurality of loudspeakers driven without filters and positioned at 
virtual source locations that are substantially symmetrically 
positioned with respect to the listener location. 

Thus, the filter means is arranged to compensate for the asymmetric 
positioning of the loudspeakers with respect to the listener 
location. 

For example, with reference to Figure 1, a transfer function matrix 
C(z) of electro-acoustic transfer functions relates loudspeaker 
inputs to the outputs of microphones placed at the location of the 
listener ears in the sound field. A matrix H(z) of inverse digital 
filters can be used to process conventional two-channel stereophonic 
recorded signals prior to transmission by the loudspeakers in order 
to produce the desired effect, as specified by a matrix A(z) at 
microphones placed in a sound field. Matrix ACz) is the transfer 
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function matrix relating the recorded signals to the signals that 
are desired to be reproduced at the microphones, and the matrix A(z) 
in accordance with the first aspect of the invention is selected by 
assuming that we have the said specified virtual source locations. 

• 

According to a second aspect of the present invention a sound 
reproduction system comprises at least two loudspeakers which are 
driven through a filter means by at least two channels of a sound 
recording, the filter characteristics of the filter means being so 
chosen as to create at the listener location a local sound field 
which is substantially equivalent to the local field that would 
result from playing the sound recording through the loudspeakers 
driven without filters and positioned at virtual source locations 
that are more widely spaced from one another than the actual spacing 
of the loudspeakers. 

Thus the filter means is arranged to create an impression at the 
listener location that the loudspeakers are more widely spaced than 
is actually the case. 

According to a third aspect of the present invention a sound 
reproduction system comprises four loudspeakers which are arranged 
at spaced-apart positions to create a sound field, first and second 
predetermined listener locations within the sound field, the 
loudspeakers being driven through filter means by two channels of a 
sound recording, the filter characteristics being so chosen as to 
create at both the first and second listener locations a respective 
local portion of the sound field which is substantially the same as 
the sound field portion that would be produced at that respective 
location by playing the unfiltered channels of the sound recording 
through a pair of loudspeakers positioned symmetrically with respect 
to the respective location. 

A fourth aspect of the invention is concerned with an adaptive audio 
system and to a method of updating the filter coefficients of the 
adaptive filter of the system. 



WO 94/01981 



PCT/GB93/01402 



- 3 - 

According to the fourth aspect of the present invention an adaptive 
audio system comprises an adaptive filter having a plurality of 
alterable filter coefficients, a processor implementing an algorithm 
which cyclically performs in turn a filtering operation utilising 
the- filter, and a filter updating operation to update the filter 
coefficients, in which the algorithm is arranged in a cycle thereof 
to adjust (if necessary) only a limited number of the filter 
coefficients before performing a filtering operation. 

This aspect of the invention enables the use of a filter with a 
relatively large number of coefficients whilst facilitating a high 
sampling rate. 

Preferably only one coefficient of the filter is adjusted in each 
cycle. 

The algorithm is preferably the LMS algorithm [2] or the filtered-x 
LMS algorithm [3]. In the latter case, the filtered reference 
signal employed can be calculated on a single-tap basis. 

It is shown hereafter that the sparse update implementation method 
of the fourth aspect of the invention reduces the operation count by 
a factor of 2 in the case of the LMS algorithm and by a factor of 3 
in the case of the filtered-x LMS algorithm. More importantly, the 
bulk of the computational work that remains to be done is mainly 
related to the actual filtering operation. These calculations can 
be performed by a dedicated filtering unit external to the main 
processor and then most of the processing time can be dedicated to 
update the filter. This technique can be easily extended to complex 
systems based on a number of adaptive filters having arbitrary 
lengths. This is an ability which is highly desirable in 
multi -channel applications, such as those of the first, second or 
third aspects of the invention. 
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INTRODUCTION 

Dieital filters can be used to operate on recorded signals prior to their 
SSsmission via loudspeakers in order to enhance the reproduction of those 
In the simplest case, an inverse filter can be designed » order to 
compensate for deficiencies in a loudspeaker/room frequency response. Such a 
fTJZ L designed to produce a . ^-sfer Wtion ^een * e hi e r mpu and 
the sound pressure at a point in the sound field which has a flat ^agnituae 
response and a linear phase response; in time domain terms an impulse appl ed 
S the inpTcan be "almost perfectly" reproduced (at a little time later) at the 
outnut The Purpose of the work presented here however, is to demonstrate that 
^princSe P rn be extended to the multi-channel case and to i ustrate the 
consfderaSe potential for the use of multi-channel inverse niters in the 
rep^oduSron JTsSuL In Section 1, the background to the filter design problem 
S P rev!eweS briefly before the solution to the multi-channe problem is 
Resented The filter design problem falls naturally into a least squares 
Fr^ewol eve'n though filters deigned with a least squares objective may no 
ultimately provide the best psychoacoustical oenefits Tne oasis 
design technique assumes that measurements can be made m the reprodu ced 
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sound field in order to compare the reproduced signals with the signals that are 
desired to be reproduced. The fact that the filters can therefore be designed 
adaprively opens up the possibility of tailoring the individual filters to the 
requirements of a particular listening space. However, the filters do not 
necessarily have to be adaptive and modified to accommodate changes in 
Sag room acoustics; the design process can equally well be used in order to 
establish fixed filters used, for example to modify the position of stereophonic 
SSes^i an hi-car entertainment system. It is the latter topic that we address in 
2, where computer simulations are presented which demonstrate some 
uWesting possibilities. In particular, it is shown that a matrix of »v«?W*™ 
Sn be used to provide "virtual source" locations at positions other than those of 
ft? actual loudspeakers. Furthermore, the possibility is examined of providing 
multiple "ideal listening positions"; it is demonstrated that an appropriately 
designed filter matrix can be used to operate on the two channels of a 
stereophonic recording in order to closely reproduce these signals at two , pair of 
points in the reproduced field. The ultimate test of these possibilities will of 
course le psvchoacoustical. In the meantime however, attention will be 
concentrated on the physical aspects of the problem. 

1. MULTI-CHANNEL INVERSE FILTERING USING A LEAST SQUARES 
FORMULATION 

1.1 Background to least squares filter design 

The design of digital filters for single channel equalisation is most readily 
Ipproach°ed using°traditional "least squares" methods. This technique has its 
roots in the classfcal approach of Wiener (1) in which the impulse response of a 
futer is constrafned to be causal and designed in order to minimise the time 
averaged squared error between the filter output and the "desired" filter output 
The governing equations which have to be satisfied to ensure an optimal. design 
are easier to handle when working in discrete (rafter than continuous) time and 
the least squares method has become a standard technique m digital filter design 
Furthermore, the LMS algorithm of Widrow and Hoff [2 provides a* , eftont 
numerical technique for rapidly adapting the coefficients of an FIR digital JUter to 
provide the optimal impulse response. In acoustics, this algorithm requires ^a 
further modification before it can be utilised. In the loudspeaker equahsanon 
problem, for example, the output from the filter to be designed 1 passes through 
The electroacoustic path between the loudspeaker input and the point m space at 
which equalisation is sought. This additional transfer function has to be 
Tccounted for when using The LMS algorithm and the appropriately modified 
version has become known as the "filtered-." LMS algorithm as described by 
Widrow and Stearns [3]. The algorithm was first proposed by Morgan [4] and 
independently for use in feedforward control by Widrow [5 and fa, - the ^active 
control of sound by Burgess [6]. In many problems mvo vmg the active control 
of sound and vibration it is often necessary to use multiple inputs and *o ensure 
that the control filters are designed to ensure the mmimisation ^ som 
appropriate measure of error at multiple points m space [7]. This ^r?™™ 
led P ElHott and Nelson [8] to generalise the filtered-* LMS I algorithm to .deal wUh 
multiple errors. The resulting algorithm has become known as ^e Multiple 
Error LMS algorithm {9] and it has been extensively utilised in a vane./ ot 
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applications involving the active control of sound and vibration (see Nelson and 
Elliott (10] for a full account). 

The Multiple Error LMS algorithm has been still further generalised and 
specifically applied to problems in audio system equalisation by Nelson et al [11, 
StotKt^S*, the formulation of the problem was generalised to incorporate 
multiple input signals. This is the case found in stereophonic reproduction for 
3 Xre the algorithm has since been applied with considerable success 
STS genial formulation of the least squares filter design problem wi I again 
be presented here before describing further potential applications of the 
technique to the reproduction of sound. 

1.2 Solution of the multi-channel problem 

The multi-channel problem is illustrated in block diagram form in Figure , 1 
Working in discrete"^ time, we have K recorded signals x k M comprising the 
vector x(n) These are transmitted via M loudspeaker channels whose inputs are 
riven by y jn) which are the elements of the vector y(n). The resulting signals 
Ire transmuted via a matrix C(z) of electroacoustic : transfer functions and 
detected by L microphones whose outputs are given by z L (n), these being the 
elements of the vector z(n). We introduce a K x M matrix H(z) of FIR digital 
futers which operates on the K recorded signals prior to 

loudspeakers. The coefficients of the filters in the matrix are designed £™tato 
minimise the weighted sum of the mean squared values of the error signal 
Sn) The rth error signal is defined as the difference between the destred signal 
dfin) and the reproduced signal zfin). The desired signals «-,(n) <~PJ«g*J 
vector d(n» are in turn specified by passing the recorded signals x k (n) through a 
K x L matrix Ad) of filters. The filter matrix A(z) thus ^ffj^/ s ^ 
signals. Whilst this is the most general method of specifying the desired signals 
the elements of A(z) will in general include appropriate "modelling delays such 
that mS? desired signals are° in some sense delayed versions of the recorded 
signals. This is clearly necessary if reductions in mean s q u * red *™ * £ 
achieved when the elements of C(z) are non-minimum phase. The way in 
which A(z) may be specified will become clearer in the next section. 

The problem of finding the optimal coefficients of the fi Iters mHW will nowbe 
addressed As in the single channel case when developing the filtered-* LMb 
algor! hm the analysis is g°reatly assisted by effective reversal of 
transfer functions H(z) and C(z). This leads to the 

reference signals" r lmk W which are the signals produced by passing * th 
recorded signal through the l,m th element of the matrix C(z). Thus the signal 
z/(n) can be expressed as 

K M 1-1 

Zf(n)= X I S h mk (i)r lmk (n-i) W 
k = 1 m = 1 i=0 

where h m Ui) is the fth coefficient of the FIR filter whose input is the k'th 
recorded signal and whose output is the input to the m'th loudspeaker Each 
filter is assumed to have an impulse response of I samples in duration. In vector 
notation we can write equation (1) as 



WO 94/01981 



PCI7GB93/01402 



- 7 - 



z,(«) = hT(o) r,(n) + hHD r,(n - 1) . . . UQ - 1) r, (n - 1 + 1) 



(2) 



where we have defined a composite tap weight vector and a reference signal 
vector respectively by 

mi) = [MO fcaiO • • *mi<0 I fc« » *3B» ■ • WO I kirfO • • *mk<0] (3) 

r,T(n) = [rin<") H21 <«> ■ ■ r /ml("> I r l2(«> r l22<"> • • 

• • r m2 in) I r nK («) ife* (n) - . r, MK (») J W 

A further composite tap weight vector can be defined which consists of all the I 
tap weights of all the L x M filters. Thus is given by 



Wr=[U(o)bni)..hT(l-D] 
The L'th order vector of error signals can now be written as 

e(n) = R(n)w - d(n) 
where the matrix R(n) of filtered reference signals is given by 



(5) 



(6) 



R(n) = 



ri r («) r^H-D - r^Oi-f + D 
r 2 T(n)r 2 T («-l)..r2 T (»- / + 1) 

r L T(it) rjin - 1) . - - rjin - I + 1) _ 



(7) 



With the vector of error signals defined by equation (6), we can now proceed to 
determTn " he optimal value of the composite tap weight vector w whjch 
minimises the sum of the squared error signals. 

oroblem somewhat by minimising a cost function which allows for differentia 
w^ ghting o? the squared errors (which may be important in some ^cation^ 
rna S also S penalises\he "effort" used in arriving at the optimal solution^ The 
latter strategy may prove useful in the event of the problem becoming ill- 
SStSf wheJuttle reduction in error is achieved at the expense of large 
values of the filter coefficients. Thus we define a cost function given by 

/ = E[eT(n) W e e(n> + wHn) W„ w(n)] (8) 

where W e and W w are (generally diagonal) weighting matrices and E[ 1 denotes 
the expectation operator. The minimum of this *™ d ™*™£Zd «£^ 
bv first substituting the expression for e(n) given by equation (6) and then settin 
the gradient of / with respect to w equal to zero. Thus assuming W e is 
symmetric, / can be written as 

J = E[dT(„) W e din) - 2 wTRT(„) w, din) + w r (R T (n) W c R(n) + W w )w] (9) 
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The gradient of / with respect to w, also assuming that Vi w is symmetric, can be 
written as 

J^ = 2e[(RT-(h) W c R(n) + W> - R?"(n) W e d(n)] (10) 

Thus the solution that ensures that dj/d-w is zero is given by the optimal tap 
weight vector 

w 0 = (E[R^(n) W e R(n) + wj)" 1 |e[R t («) W e d(n)]} (11) 
The corresponding minimum value of / is given by 

J 0 = E[d^(n) W e d(fi)] - E[dT(n) W e R(«)]w 0 (12) 

The optimal tap weight vector w 0 can clearly be found by inversion of the matrix 
E[R r (n) W e R(n) + Wn,] which must be positive definite for a unique minimum 
to exist. In the case when W e = I (the identity matrix) and W w = 0, this matrix 
has a block Toeplitz structure and efficient numerical schemes exist for its 
inversion [14]. The other approach is to use the Multiple Error LMS algorithm. 
This has its origin in the method of steepest descent in which the minimum of 
the function is found iteratively by updating the coefficient vector w by an 
amount proportional to the negative of the gradient of the function. First note 
that using equation (6) in equation (10) allows the gradient to be written as 

|k = 2E[R T (n) W e e(n) + W w w] (13) 

Following Widrow and Hoff [21, we now make the assumption that the filter 
coefficients are updated by an instantaneous estimate of this gradient, which is 
given by dropping the expectation operator in equation (13). Thus the tap weight 
update equation becomes 

w(m + 1) = w(n) - a W v w(n) - a R T (n) W e e(n) ( 14 > 

where a is a convergence coefficient. Equation (14) thus specifies a simple and 
readily implemented algorithm for iteratively converging to the solution for the 
optimal coefficient vector. As pointed out by Elliott et al [9], the effect of the 
"effort" weighting V4 w can be shown to be equivalent to the "leaky LMb 
algorithm in° which case in the absence of an error term e(n), the coefficient 
vector w would decay away to zero. Note that the implementation of the 
algorithm requires the generation of the filtered reference signals r /mJt (n) which 
comprise the elements of the matrix R(n). These can be generated by passing the 
recorded signals x k {n) through FIR filters which give an estimate of the transfer 
function C fm (z). These in turn can be identified by using a broadband training 
signal passed through the m*th loudspeaker to the fth microphone with the 
LMS algorithm used to adapt the filter coefficients. 
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13 Relationship of the least squares solution to methods for finding exact 
inverse filters 

An alternative approach to inverse filtering in room acoustics is that proposed by 
Myoshi and Kaneda [15] in the form of the Multiple-Input /Output Inverse 
Filtering Theorem (MINT). In that work, it is demonstrated that a pair of filters 
can bk designed which can be used on the inputs to two loudspeakers which are 
both used to transmit a given recorded signal to a specific point in space. The 
filters can be designed to ensure that "perfect" equalisation of the transmission 
path is produced, even when the transmission paths between the loudspeaker 
inputs and the point where equalisation is required are non-minimum phase 
(after an appropriate bulk delay has been subtracted; a point which is not 
altogether clear from Myoshi and Kaneda's paper [15]). An analysis of the 
relationship between MINT and the least squares approach presented above has 
been presented by Nelson et al [16]. This leads to a useful result which may have 
some bearing on the choice of the values of K, M and L in a given application, 
together with number of coefficients / used in the inverse filters. 

It is assumed at the outset that the transmission paths C/ m (z) can be adequately 
represented by FIR filters having j coefficients. It can then be shown that to 
produce exact equalisation of the transmission channels from K recorded signals 
to L = K points in space requires that the number of coefficients in the inverse 
filters is given by 



The full derivation leading to this result is presented in reference [16]. However, 
it should again be emphasised that the choice of I given by equation (15) ensures 
the exact equalisation of the J-coefficient FIR filters representing the transmission 
paths C im (z); equalisation of the real transmission paths therefore assumes that 
all these paths are exactly represented by /-coefficient filters. Nevertheless, the 
analysis presented in reference [16] gives some indication of the number of 
coefficients 1 required in the inverse filters and demonstrates furthermore that in 
order to realise an exact inverse, it is required that M > L (i.e. a greater number of 
loudspeakers is required than the number of points at which equalisation is 
attempted). This result is therefore consistent with the work of Myoshi and 
Kaneda [15] who show, for example, in the case M = 2, L = 1, that / = (/ - 1). The 
work presented in reference [16] generalises Myoshi and Kaneda's result and also 
suggests that the Multiple Error LMS algorithm can be used to find the required 
solution for the coefficients of the inverse filters. 




(15) 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a block diagram for the analysis of multi-channel 
inverse filtering problems in sound reproduction, 

Figure 2 shows a geometrical arrangement of real sources (SI, S2) 
whose outputs are prefiltered in accordance with the invention in 
order to produce signals at microphones (Ml, M2) which appear to 
originate from virtual sources (VI, V2). 

Figure 3 shows the impulse responses from inputs x 1 (dashed line) 
and x 2 (solid line) to the two microphones shown in Figure 2 before 
(Control = OFF) and after (Control = ON) the introduction of the 
filter matrix H used to pre-filter the outputs of sources SI and S2. 
The relative arrival times of the impulses at the microphones are as 
if they originated from virtual sources VI and V2. A modelling 
delay of 6,25 ms included during the simulations has been removed 
from the prefiltered responses (lower plots). 

Figure 4 shows the impulse responses of the elements of the filter 
matrix H designed to pre-filter the outputs of sources SI and S2 
shown in Figure 2. The dashed lines show the impulse responses of 
the filters operating on input x 1 and the solid lines show the 
impulse responses of the filters operating on input x 2 - 

Figure 5 shows frequency response functions associated with the 
composite system H(z)C(z) whose impulse response is illustrated in 
the lower traces of Figure 3. The phase responses have been 
calculated relative to that associated with the first arrival in the 
time domain. Note the small difference in magnitude response and 
the substantial difference in phase response corresponds to that 
which would be expected to be produced by the virtual source 
locations shown in Figure 2. 
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Figure 6 shows a further geometrical arrangement of real sources 
(SI, S2) whose outputs are prefiltered in accordance with the 
invention in order to produce signals at microphones (Ml, M2) which 
appear to originate from virtual sources (VI, V2). 

Figure 7 shows impulse responses from inputs (dashed line) and x 2 
(solid line) to the two microphones shown in Figure 6 before 
(Control = OFF) and after (Control = ON) the introduction of the 
filter matrix H used to pre-filter the outputs of sources SI and S2. 
Note the increased time between the arrivals of the impulses 
produced by the location of the virtual sources VI and V2. 

Figure 8 shows impulse responses of the elements of the filter 
matrix H designed to pre-filter the outputs of sources SI and S2 
shown in Figure 6. The dashed lines show the impulse responses of 
the filters operating on input x 1 and the solid lines show the 
impulse response of the filters operating on input x 2 . 

Figure 9 shows frequency response functions associated with the 
composite system H(z)C(z) whose impulse response is illustrated in 
the lower traces of Figure 7. The phase responses have been 
calculated relative to that associated with the first arrival in the 
time domain. Note the small difference in magnitude response and 
the substantial difference in phase response corresponds to that 
which would be expected to be produced by the virtual source 
locations shown in Figure 6. 

Figure 10 shows a source and microphone layout used in simulating 
the production of multiple stereo images- The two signals x 1 and x 2 
are filtered in accordance with the invention with a matrix H(z) 
prior to transmission via sources SI - S4. 

Figure 11 shows impulse responses of the composite system H(z)C(z) 
for inputs x 1 (solid line) and x 2 (dashed line) when HCz) is 
designed to ensure that x-j is reproduced at microphones Ml and M3 
and that x ? is reproduced at microphones M2 and M4 (see Figure 10). 
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Fiqure 12 shows frequency response functions corresponding to the 
impulse responses of Figure 11. 

Figure 13 shows the impulse response of the composite system 
H(z>CCz) when x ] (solid line) and x £ (dashed line) are filtered by a 
2x2 matrix H(z) prior to transmission via sources SI and S2 in 
Figure 10. H(z) is designed to ensure that x ] and x 2 are reproduced 
at Ml and M2 respectively. Note the degradation of the response at 
M3 and M4. 

Figure 14 shows frequency response functions corresponding to the 
impulse responses of Figure 13. 

Figure 15 shows impulse responses of the composite system H(z)C(z) 
when x 1 (solid line) and x 2 (dashed line) are filtered by a 4 x 2 
matrix H(z) prior to transmission via all four sources in Figure 10. 
H(z) is designed to ensure that x 1 and x 2 are reproduced at Ml and 
M2 respectively. 

Figure 16 shows frequency response functions corresponding to the 
impulse responses of Figure 15. Note the improvement in cross talk 
cancellation at Ml and M2 compared to the results of Figure 14. 

Figure 17 shows at (a) a block diagram for the adaptive system 
identification problem. The filter G must be adapted so that its 
output approximates the output from the plant, and at (b) a block 
diagram for the adaptive system equalization problem. The filter H 
must be adapted so that the output from the plant approximates the 
output of a given target system which, in general, includes a 
modelling delay that ensures the existence of a causal optimal 
filter. 

Figure 18 shows the frequency response function of a loudspeaker in 
an anechoic chamber. Unequalized (dashed line) and equalized using 
in accordance with the invention the sparse update adaptive 
algorithms (solid line). 
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2 POTENTIAL APPLICATIONS OF MULTI-CHANNEL TECHNIQUES IN 
THE IMPROVEMENT OF SOUND REPRODUCTION SYSTEMS 

2.1 The use of multiple errors in single channel response equalisation 

Of course the simplest equalisation problem corresponds to the : case where ^an 
inverse filter is introduced on the input to a loudspea ker and *e M te i 
designed to ensure that the signal reproduced at a point ^JffS^iS A 
possible (in the least squares sense) to a delayed version of the input signal. A 
simple example of such an approach is that presented by Kuriyama and 
Sam [17] who use the single channel filtered-x LMS algorithm to design a 
filter foTthe equalisation of th?on-axis response of a three-driver loudspeaker 
system ^(including cross-over networks). A 512 coefficient FIR filter operating at a 
sampTe rate of 32 kHz succeeded in producing a flat amplitude and linear phase 
response, between 200 Hz and 12 kHz. No results were reported however, of 
measurements of the response of the system at spatial positions other than those 
at which equalisation was achieved. This is a crucial issue m response 
equalisation problems; the effect of the equalising filter on the global 
Performance of the system. This problem has been addressed by Wilson [18] m 
fhe context of loudspeaker equalisation. It was demonstrated thai : nummismg a 
weighted sum of squared errors derived from measurements of both on- and off- 
^responses was successful in improving the off-axis response also, albeit at the 
expense of the improvements in on-axis response. Similar comments apply to 
rSm response equalisation. Farnsworth et al [19], for example used computer 
su^afions of an enclosed sound field to investigate the effect of equalising the 
Session response to one point in a room on the responses at other points in 
S Z The results predicted a potentially severe degradation of response 
outside™ zone surrounding the equalisation points. The subjective response to 
SSs effect, however, has yet to be quantified. An attempt to ensure global 
eolation" was made by Elliott and Nelson [20] who also used I computer 

simulations to study the equalisation of the low frequency acoustic field in a car 

nterior In that ca'se, multiple errors were 

signals bein<* specified at a number of points in the sound field. These desired 
sifnals weredelaved versions of the input signal with the delays chosen to be 
eq°ual to the 'acoustic propagation delays from the simulated loudspeaker source 
to the po nts at which the desired signals were specified. This strategy predicted 
"globafequalisation" in the sense that the frequency respons e was improved at 
all the equalisation points, but of course was not made perfect at any one point. 
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2.2 Response equalisation in stereophonic reproduction 

Rather than attempting (the unrealisable) goal of ^^^^S^ 
single channel system (which is then subsequently used in * ^tereopnomc 
epfodurtion system), a more realistic approach may be * 

SSTS equalisation techniques. One such applicauon is described in 
deiail by Nelson et al [13] where a 2 x2 matrix of filters H(z) was used to process 
two recorded signals prior to transmission via two loudspeakers The desired 
sfenaTs were specified at two points in space, these being at the location ot the 
eals o a huma P n listener. These desired signals were specified as simply r delayed 
versions of the "left" and "right" channels ^fT.^^ 
equalisation filters had been adapted, the left ear o the listener would perce ve 
only the recorded signal from the left channel and the right ear would ]?ej ce ^ e 
on y the horded s f gn al from the right channel. The inverse ™ er 
effective in canceHing°the "cross-talk" between left loudspeaker and I ngh t ear 
vice-versa in addition to equalising the frequency responses ot both 
J££L? As'such, the syslen .represents digital imp ^™ t * 
system suggested by Atal and Schroeder 121], the impressive suojecn 
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capabilities of the original analogue implementation being graphically described 
by Schroeder [22]. Such a system is well suited to the reproduction of binaurally 
recorded sound fields, such as those recorded with the use of an artificial head. 

However most modern recordings do not use such techniques and the typical 
two-channel stereophonic recording is a carefully mixed amalgam of individual 
signals with left and right channels attributed with signal components in order 
to maximise the effective stereophonic illusion, largely in accordance with the 
subjective judgements of the producer of the recording. If one accepts that such 
recording techniques will not readily be changed, the potential for the further 
exploitation of equalisation techniques becomes focussed on providing realisable 
improvements in existing reproduction systems. One such improvement that 

can be obtained in accordance with the invention is to use adaptive 
filters to compensate for practical deficiencies in loudspeaker 

location relative to a listener. In the case of in-car 
entertainment systems for example, it is very difficult to locate the loudspeakers 
symmetrically with respect to the listener and thus produce the stereophonic 
illusion initially perceived by the recording engineer. The same may be true of 
many domestic listening environments. With appropriately designed filters, 
however, the loudspeakers can effectively be shifted to "virtual locations" which 
apparently seat the listener in the optimal location. 

The geometry used in computer simulations of this approach in 
accordance with the first aspect of the invention is shown nn Fngure 
2. The matrix C(z) which relates the signals output from sources SI and S2 to 
microphones M l and M 2 (placed at the location of the listeners ears) is given by 









r\2 




Z-A22 


- r 21 


r 22 



(16) 



where these transfer functions are the digital versions of the continuous time 
transfer function relating the pressure at a point in space to the volume 
acceleration of a point monopole source. Thus the delays A m/ are given by 



A m/ = round (/ s r mi /cj 



(17) 



where r m , is the distance between the m'th source and the i'th microphone, f s is 
the sampling frequency and c 0 is the sound speed. The matrix A(z), which 
specifies the relationship between the recorded signals and the desired signals is 
selected by assuming that we have certain "virtual source" locations VI and V2, 
as illustrated in Figure 2. This ensures that the desired signals <*,(n) and d 2 (n) are 
those that would be produced by sources VI and V2. Thus 



A(z) = 



p 0 z- & morf 



4ic 



»n "12 

Z - *2i Z~Tfi2 
- "21 "22 J 



(18) 
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where ^mcA is a modelling delay and the delays y m{ are given by 

y ml = round(/ s u m i/c 0 ) * 19) 

where u m , is the distance between the m'th virtual source location and the I'th 
rrSophone. In all the simulations that follow, the sample rate f s was chosen to 
8 kHz with c n = 341 ms-i and p 0 = 1.0 kg/m* for simplicity. The number of 
cLffi^trlnme filter matrix Hi* was always chosen to be ****** Wters 
w?re SSjied adaptively using pseudo-random sequences x,C*> and x 2 («). 

fc ^es Tl^e^ 

sources SI and S2 are no [ omy piac y i g ^ result 

iKine the algorithm of equation (14) with W w = 0andW e = I. The corresponain 0 

response plots shown in Figure o. These in particular snow no 

Also note that the magnitude responses are consistent with ^ d^ffere nee m 

be accounted for in the implementation of a real system. 
In accordance with a second aspect of the invention 

this approach with the geometry of the real ana virn» _ DDlied via the 

In particular, the time between arrivals at a given responS es of the 

consistent with the location of the ™"£"~^8^ 15™9 shows the 

filters necessary to accomplish this are shown* ^Rgure8, and Hg 

frequency response functions of the ^^^Sa^ ^ the specified 

illustrate the magnitude and phase differences associate 

virtual source locations. 

2.3 The production of multiple stereophonic images 

In accordance with a third aspect of the l"^"^ design philosophy 
yet another possibility that emerges from the ^ f a conventiona l 

Outlined above, is that of ^^^JJ^iS source locations for two 
stereophonic recording in order to produce ideal virtua 
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independent listening positions. Such an approach does however require the 
use of at least four loudspeakers. As an illustration of this possibility, here we 
present the results of some computer simulations (first presented by Orduna- 
Bustamante et al [23]) where the source /microphone arrangement of Figure 10 is 
considered. Also, for the purposes of illustration, we will deal with the "cross- 
talk cancellation" case where, for example, we wish to reproduce two recorded 
signals x^n) and x 2 (n) at Ml and M2 and at M3 and M4. In this case, the matrix 
C(z) takes the form 



<=»-& 



V*ll /r n ) (r-Ai2/r 12 ) (z~ Al3 /r 13 ) (zr*U/r w 

(z-A2l/r 2 i) (r-*22/r 2 2> (r^a/r^) (z~A24/r 24 ) 

(2-A31 /r 31 ) (r-A32/r 32 ) (z- A 33/r 33 ) (r-A34/r 34 ) 

(z-^4l/r 41 ) (z-A42/r 42 ) (z-*43/r 43 ) (z-*44/r 44 ) _ 



(20) 



and the matrix A(z) which defines the desired signals is given by 



A(z) = z~&mod 



1 0 

0 1 

1 0 
.0 I. 



(21) 



where & mod is a modelling delay. In the simulations presented here, the 4x2 
matrix H(z) was comprised of eight FIR filters each having a number of 
coefficients J = 128. The modelling delay A mod was chosen to be 96 samples. 

The resulting impulse responses of the system consisting of H(z) C(z) are 
illustrated in Figure 11 and the corresponding frequency response functions are 
shown in Figure 12. A good degree of cross-talk cancellation is clearly evident, 
although the system becomes less effective at certain frequencies where the 
inverse filters have difficulty in modelling what amount to poles of the inverse 
system (see Nelson et al [13] for a fuller discussion). The use of IIR filters may 
provide a solution to this problem and a preliminary investigation is reported in 
reference [24]. The use of IIR filters in the single channel case has also been 
examined by Greenfield and Hawksford [25]. This problem is more clearly 
illustrated in the simple case of the reproduction of signals x^n) and x 2 (n) at Ml 
and M2 respectively. The results of implementing such a system are illustrated 
in Figures 13 and 14. Effective cross-talk cancellation is produced (except at 
certain frequencies) at microphones 1 and 2, whereas the signals at microphones 
3 and 4 are clearly degraded by the operation of the system, as one would expect. 
Finally, the potential advantages in using more sources than microphones in the 
sense described in Section 2.3 is illustrated by the results presented in Figures 15 
and 16. Here, the 4x2 matrix H(z) has been designed to ensure reproduction of 
Xt(w) and x 2 (w) at microphones 1 and 2. Considerably improved cross-talk 
cancellation can be seen to have been achieved. 
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SPARSELY UPDATED FILTERS FOR ADAPTIVE DIGITAL PROCESSING OF AUDIO 
SIGNALS 

A sparse update strategy will now be described that allows the 
implementation of adaptive filters at high sampling rates using 
existing DSP technology. This technique has the important property 
that the processing time per sampling period spent in filter update 
operations is independent of the filter length. Code will be given 
for both the LMS and the filtered- x LMS algorithms and description 
of their practical use for loudspeaker equalisation, 

INTRODUCTIOR 

The single-channel system identification problem is shown in Figure 17(a). A vector 
containing the L most recent samples of the input signal is defined as 

x„=[ jc« x*-i ... ] T ; 

similarly, a vector containing the coefficients of a (non-stationary) non-recursive digital filter is 
defined as 

S«=[ £o(«) gito ... ^wW ] T - 
The dot product of the input vector with the coefficient vector produces the output 

y*. = glx*. 

This is required to approximate the desired signal d m (the output from the system under 
identification), in a way that minimizes the variance of the error signal defined by 

The LMS algorithm [3] performs a stochastic gradient search in /^dimensions according to the 
formula 

which can be shown to converge on the mean to the exact least mean squares solution of the 
problem provided the adaptation rate is chosen to satisfy the following condition 

\l<UU5l 

The implementation of this algorithm in a microprocessor requires 4L arithmetic operations 
(2L to perform the filtering and 2Z, to update the filter). At high sampling rates this can prove 
very taxing and heavy restrictions must be imposed on the number of coefficients. 
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The system equalization problem is shown in Figure 17(b). In this case, a new filter with 
coefficients 

h* = [/*o(«) fain) ... fck-tOO ] F 
acts on the input signal to produce the signal 
y n - hlx H 

which after transmission through the system produces an output z H that minimizes the variance 
of a new error signal defined as 

The coefficients of the optimal filter can be searched using the filtered-* LMS algorithm [9] 

h„ + i =h (f +|i£„r ) ,; 
where the vector containing recent samples of the reference signal 

r„ = g r x„, 

is used instead of the vector of input signals as in the LMS algorithm. This increases the 
arithmetic work by another 2L operations (assuming that both filters g and h have the same 
length). The arithmetic work thus increases to 6L operations. 

SPARSE UPDATE IMPLEMENTATION OF THE LMS ALGORITHM. 

One way of reducing the operation count required to update the filter coefficients 
using the LMS algorithm is to devise a criterion to select the absolute minimum number of 
operations that are still necessary to maintain the convergence properties of the algorithm. The 
minimum work that can be done is to update only one filter coefficient per sampling period. 
This can be implemented by performing die following operations at every processing cycle n 

*. 

gkin) = g*(rt) + lLe H x n -k , (update current filter tap) 
£ = l)modL; (increment tap counter) 

where k is a counter set initially to h=0 that runs circularly along the vector of filter 
coefficients. Note that because n and k are always incremented by 1 every cycle (except when 
k wraps around to zero), the input sample that is used for the update is exactly the same 
during the L cycles that it takes to perform one pass along the whole filter. This observation 
leads to the following alternative, but exactly equivalent, version of the sparse update version 
of the LMS algorithm 

if k =s 0 then a = nx„ (store, and pre-multiply, input sample) 

gt(n) = g tin) +ac H (update current filter tap) 

k = (Jfc + 1 )mod L (increment tap counter) 

By following any of these procedures, it takes L processing cycles to update the whole filter, 
but the operation count is reduced to 2L (basically those involved in the actual filtering). 
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SPARSE UPDATE IMPLEMENTATION OF THE FILTERED-X LMS ALGORITHM 

. A sparse update implementation of the filtered-x LMS algorithm presents the 
additional challenge of having to calculate the filtered reference signal in a way which is 
compatible with the sparse update of the filter coefficients. Interestingly enough the 
calculation of the filtered reference signal can be performed also on a single-tap basis as 
follows 

if k = 0 then r = 0 (clear filtered-x accumulator) 

r = r + gis-*-\x n (accumulate current product) 

if k = L - 1 then a = \ir (when done, store filtered-x) 
k = (Jfc + 1 )mod L (increment tap counter) 

h k (n) = hk(n) + CLen (update current filter tap) 

Note that the calculation of the next sample of the filtered reference signal starts L cycles in 
advance. To this end, the coefficients of the reference filter g are accessed in reverse, as 
shown, and the filtering makes use of the most recent input sample at every cycle (which, as it 
were, gets old by itself as times goes by). The operation count reduces again to just over 2L as 
in the sparse update implementation of the LMS algorithm. 



LOUDSPEAKER EQUALIZATION USING SPARSE UPDATE ADAPTIVE FILTERS. 

Figure 18 shows the frequency response function of a loudspeaker in an anechoic 
chamber equalized to obtain a flat magnitude response and a linear phase response. The 
processing was performed in floating point arithmetic using a Texas Instruments TMS320C30 
processor. The sampling frequency was set to /=32 kHz and the filter length to L=48. The 
impulse response function of the system was first identified using the sparse update version of 
the LMS algorithm. The system was later equalized using the sparse update implementation of 
the filtered-x LMS algorithm. (A full update could only be possible at/=16kHz or L=24.) 
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CLAIMS 

1. A sound reproduction system comprising a plurality of 
loudspeakers (SI, S2) for positioning at positions spaced from a 
listener location (Ml, M2), loudspeaker drive means (H) for driving 
the loudspeakers (SI, S2) in response to a plurality of channels of 
a sound recording (x), the loudspeaker drive means comprising a 
filter means (H), the sound recording (x) being suitable for playing 
normally through a plurality of speakers normally positioned 
generally symmetrically with respect to a listener location, 
characterised in that the filter characteristics of the filter means 
(H) are so chosen as to create, in use of the reproduction system, 
at the listener location (Ml, M2) a local sound field which is 
substantially equivalent to the local field that would result from 
playing the sound recording (x) through a plurality of loudspeakers 
driven without said filter means (H) and positioned at virtual 
source locations (VI, V2), at least one of the virtual source 
locations (VI, V2) being displaced substantially from the actual 
location (SI, S2) of the corresponding loudspeaker. 

2. A sound reproduction system as claimed in claim 1 characterised 
in that each of said virtual source locations (VI, V2) is displaced 
from the actual location (SI, S2) of the corresponding loudspeaker. 

3. A sound reproduction system as claimed in claim 1 or claim 2 
characterised in that the actual locations (SI, S2) of the 
loudspeakers, during use of the sound reproduction system, are 
predetermined positions that are asymmetric with respect to the 
listener location (Ml, M2), whereas said virtual source locations 
(VI, V2) created by the filter means (H) are substantially 
symmetrically positioned with respect to the listener location (Ml, 
M2). 

4. A sound reproduction system as claimed in claim 2 characterised 
in that the actual locations (SI, S2) of the loudspeakers, during 
use of the reproduction system, are predetermined positions that ar 
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more narrowly spaced from each other than the spacing of the virtual 
sources (VI, V2) created by the filter means (H) . 

5. A sound reproduction system comprising four loudspeakers (SI, 
S2, -S3, S4) which are arranged in use at spaced-apart positions to 
create a sound field, first and second predetermined listener 
locations (Ml, M2), (M3, M4) within the sound field, characterised 
in that the loudspeakers are driven through filter means (H) by two 
channels (x-j, x 2 ) of a sound recording, the filter characteristics 
of the filter means (H) being so chosen as to create at both the 
first and second listener locations (Ml, M2), (M3, M4) a respective 
local portion of the sound field which is substantially the same as 
the sound field portion that would be produced at that respective 
location (Ml, M2), (M3, M4) by playing the unfiltered channels of 
the sound recording (x^ x 2 ) through a pair of loudspeakers 
positioned symmetrically with respect to the respective location. 

6. An adaptive audio system comprising an adaptive filter having a 
plurality of alterable filter coefficients, a processor implementing 
an algorithm which cyclically performs in turn a filtering operation 
utilising the filter, and a filter updating operation to update the 
filter coefficients, characterised in that the algorithm is arranged 
in a cycle thereof to adjust (if necessary) only a limited number of 
the filter coefficients before performing a filtering operation. 

7. An adaptive audio system as claimed in claim 6 characterised in 
that only one coefficient of the filter is adjusted in each cycle. 

8. An adaptive audio system as claimed in claim 6 or claim 7 in 
which the algorithm is the LMS algorithm. 

9. An adaptive audio system as claimed in claim 6 or claim 7 
characterised in that the algorithm is the filtered-x LMS algorithm. 

10. An adaptive audio system as claimed in claim 9 characterised in 
that the filtered reference signal employed is calculated on a 
single-tap basis. 
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